A Methodology for Coupling Fragments of XPath with Structural Indexes for XML Documents

نویسندگان

  • George H. L. Fletcher
  • Dirk Van Gucht
  • Yuqing Wu
  • Marc Gyssens
  • Sofia Brenes
  • Jan Paredaens
چکیده

Supporting efficient access to XML data using XPath [3] continues to be an important research problem [6, 12]. XPath queries are used to specify nodelabeled trees which match portions of the hierarchical XML data. In XPath query evaluation, indices similar to those used in relational database systems – namely, value indices on tags and text values – are first used, together with structural join algorithms [1, 2, 19]. This approach turns out to be simple and efficient. However, the structural containment relationships native to XML data are not directly captured by value indices. To directly capture the structural information of XML data, a family of structural indices has been introduced. DataGuide [5] was the first to be proposed, followed by the 1-index [13], which is based on the notion of bi-simulation among nodes in an XML document. These indices can be used to evaluate some path expressions accurately without accessing the original data graph. Milo and Suciu [13] also introduced the 2-index and T-index, based on similarity of pairs (vectors) of nodes. Unfortunately, these and other early structural indices tend to be too large for practical use because they typically maintain too fine-grained structural information about the document [9, 16]. To remedy this, Kaushik et al. introduced the A(k)-index which uses a notion of bi-similarity on nodes relativized to paths of length k [10]. This captures localized structural information of a document, and can support path expressions of length up to k. Focusing just on local similarity, the A(k)-index can be substantially smaller than the 1-index and others. Several works have investigated maintenance and tuning of the A(k) indices. The D(k)-index [15] and M(k)-index [8] extend the A(k)-index to adapt to query workload. Yi et al. [18] developed update techniques for the A(k)-index and 1-index. Finally, the integrated use of structural and value indices has been

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Coupling Fragments of XPath with XML Indexing and Query Decomposition

Recent studies have proposed structural summary techniques for pathquery evaluation on semi-structured data sources. One major line of this research has been the introduction of the DataGuide, 1-index, 2-index, and A(k) indices, and subsequent investigations and generalizations. Another recent study has considered structural characterizations of fragments of XPath, the standard path navigation ...

متن کامل

Trie Indexes for Efficient XML Query Evaluation

As the number of applications that rely on XML data increases, so does the need for performing efficient XML query evaluation. A critical part of the solution involves providing new techniques for designing XML indexes and lookup algorithms. In this paper, we leverage the results of our research on coupling the partitions induced by fragments of XPath algebra and those induced by the structural...

متن کامل

A Structural Approach to Indexing Triples

As an essential part of the W3C’s semantic web stack and linked data initiative, RDF data management systems (also known as triplestores) have drawn a lot of research attention. The majority of these systems use value-based indexes (e.g., B-trees) for physical storage, and ignore many of the structural aspects present in RDF graphs. Structural indexes, on the other hand, have been successfully ...

متن کامل

An XPath Algebraic Characterization of A ( k ) and P ( k ) Indices with Applications to Query Processing

Recent studies have proposed structural summary techniques for path query evaluation on semi-structured data sources. One major line of this research has been the introduction of the DataGuide, 1-index, 2-index, and A(k) indices, and subsequent investigations and generalizations. Another recent study has considered structural characterizations of fragments of XPath, the standard path navigation...

متن کامل

VAMANA : A High Performance, Scalable and Cost Driven XPath Engine

Many applications are migrating or beginning to make use native XML data. We anticipate that queries will emerge that emphasize the structural semantics of XML query languages like XPath and XQuery. This brings a need for an efficient query engine and database management system tailored for XML data similar to traditional relational engines. While mapping large XML documents into relational dat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Inf. Syst.

دوره 34  شماره 

صفحات  -

تاریخ انتشار 2007